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GROWTH DIFFERENTIATION FACTOB-9 

This application is a continuation-in-part application of U.S. Serial No. 
08/003,303, filed January 12, 1993. 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

The invention relates generally to growth factors and specifically to a new 
member of the transforming growth factor beta (TGF-/3) superfamily, which is 
denoted, growth differentiation factor-9 (GDF-9). 

2. Description of Related Art 

10 The transforming growth factor p (TGF-£) superfamily encompasses a group 
of structurally-related proteins which affect a wide range of differentiation 
processes during embryonic development. The family includes, Mullerian 
inhibiting substance (MIS), which is required for normal male sex development 
(Behringer, ef a/., Nature, 345:167. 1990), Drosophila decapentaplegic (DPP) 

15 gene product which is required for dorsal-ventral axis formation and. 
morphogenesis of the imaginal disks (Padgett, ef a/., Nature, 325 :81-84. 1987), 
the Xenopus Vg-1 gene product, which localizes to the vegetal pole of eggs 
((Weeks, ef a/., Cell, 51:861-867, 1987), the activins (Mason, et aL, Biochem, 
Biophys. Res. Commun., 135 :957-964. 1986), which can induce the formation 

20 of mesoderm and anterior structures in Xenopus embryos (Thomsen, et ah, 
Cell, 63:485, 1990), and the bone morphogenetic proteins (BMPs, osteogenin, 
OP-1) which can induce de novo cartilage and bone formation (Sampath, et 
aL, J. BioL Chem., 265:13198, 1990). The TGF-^s can influence a variety of 
differentiation processes, including adipogene'sis.-myogenesis, chondrogenesis, 
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hematopoiesis, and epithelial cell differentiation (for review, see Massague, Cell 
49:437. 1987). 

The proteins of the TGF-/3 family are initially synthesized as a large precursor 
protein which subsequently undergoes proteolytic cleavage at a cluster of basic 

5 residues approximately 110-140 amino acids from the C-terminus. The C- 
terminal regions of the proteins are all structurally related and the different 
family members can be classified into distinct subgroups based on the extent 
of their homology. Although the homologies within particular subgroups range 
from 70% to 90% amino acid sequence identity, the homologies between 

0 subgroups are significantly lower, generally ranging from only 20% to 50%. In 
each case, the active species appears to be a disulfide-linked dimer of C- 
terminal fragments. For most of the family members that have been studied, 
the homodimeric species has been found to be biologically active, but for other 
family members, like the inhtbins (Ling, et al., Nature, 321:779, 1 986) and the 

5 TGF-/JS (Cheifetz, et al., Cell, 48:409, 1987), heterodimers have also been 
detected, and these appear to have different biological properties than the 
respective homodimers. 

The inhibins and activins were originally purified from follicular fluid and shown 
to have counteracting effects on the release of follicle-stimulating hormone by 

3 the pituitary gland. Although the mRNAs for all three inhibin/activin subunits 
(aa, 0A and pB) have been detected in the ovary, none of these appear to be 
ovary-specific (Meunier, etal., Proc.Natl.Acad.Sci. USA, 85:247, 1988). MIS has 
also been shown to be expressed by granulosa cells and the effects of MIS on 
ovarian development have been documented both in vivo in transgenic mice 

5 expressing MIS ectopically (Behringer, supra) and in vitro in organ culture 
(Vigier, er al., Development, 10Q_:43 ( 1987). 
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Identification of new factors that are tissue-specific in their expression pattern 
will provide a greater understanding of that tissue's development and function. 
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SUMMARY OF THE INVENTION 

The present invention provides a cell growth and differentiation factor, GDF-9, 
a polynucleotide sequence which encodes the factor and antibodies which are 
irnmunoreactive with the factor. This factor appears to relate to various cell 
5 proliferative disorders, especially those involving ovarian tumors, such as 
granulosa cell tumors. 

Thus, in one embodiment, the invention provides a method for detecting a cell 
proliferative disorder of ovarian origin and which is associated with GDF-9. In 
another embodiment, the invention provides a method of treating a cell 
10 proliferative disorder associated with abnormal levels of expression of GDF-9, 
by suppressing or enhancing GDF-9 activity. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows expression of GDF-9 mRNA in adult tissues. 

FIGURE 2 shows nucleotide and predicted amino acid sequence of murine 
GDF-9. Consensus N-glycosylation signals are denoted by plain boxes. The 
5 putative tetrabasic processing sites are denoted by stippled boxes. The in- 
frame termination codons upstream of the putative initiating ATG and the 
consensus polyadenylation signals are underlined. The poly A tails are not 
shown. Numbers indicate nucleotide position relative to the 5' end. 

FIGURE 3 shows the alignment of the C-ternrinal sequences of GDF-9 with 
10 other members of the TGF-£ family. The conserved cysteine residues are 
shaded. Dashes denote gaps introduced in order to maximize alignment. 

FIGURE 4 shows amino acid homologies among the different members of the 
TGF-£ superfamily. Numbers represent percent amino acid identities between 
each pair calculated from the first conserved cysteine to the C-terminus. Boxes 
15 represent homologies among highly-related members within particular 
subgroups. 

FIGURE 5 shows the immunohistochemical localization of GDF-9 protein. 
Adjacent sections of an adult ovary were either stained with hematoxylin and 
eosin (FIGURE 5a) or incubated with immune (FIGURE 5b) or pre-immune 
20 (FIGURE 5c) serum at a dilution of 1 :500. Anti-GDF-9 antiserum was prepared 
by expressing the C-terminal portion of murine GDF-9 (residues 308-441 ) in 
bacteria, excising GDF-9 protein from preparative SDS gels, and immunizing 
rabbits. Sites of antibody binding were visualized using the Vectastain ABC kit 
(Vector Labs). 
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FIGURE 6 shows a comparison of the predicted amino acid sequences of 
murine (top lines) and human (bottom lines) GDF-9. Numbers represent amino 
acid positions relative to the N-termini. Vertical lines represent sequence 
identities. Dots represent gaps introduced in order to maximize the alignment. 
5 The clear box shows the predicted proteolytic processing sites. The shaded 
boxes show the cysteine residues in the mature region of the proteins. The 
bars at the bottom show a schematic of the pre-(clear) and mature (shaded) 
regions of GDF-9 with the percent sequence identities between the murine and 
human sequences shown below. 

10 FIGURE 7 shows in situ hybridization to adult ovary sections using a GDF-9 
RNA probe. [^SJ-labeled anti-sense (FIGURE 7a and 7c) or sense (FIGURE 
7 b and 7d) GDF-9 RNA probes were hybridized to adjacent paraffin- 
embedded sections of ovaries fixed in 4% paraformaldehyde. Sections were 
dipped in photographic emulsion, exposed, developed, and then stained with 

15 hematoxylin and eosin. Two representative fields, are shown. 

FIGURE 8 shows in situ hybridization to a postnatal day 4 ovary section using 
an antisense GDF-9 RNA probe. Sections were prepared as described for 
FIGURE 7. Following autoradiography and staining, the section was 
photographed under bright-field (FIGURE 8a) or dark-field (FIGURE 8b) 
20 illumination. 

FIGURE 9 shows in situ hybridization to postnatal day 8 ovary sections using 
an antisense (FIGURE 9a) or sense (FIGURE 9b) GDF-9 RNA probe. Sections 
were prepared as described for FIGURE 7. 

FIGURE 10 shows in situ hybridization to adult oviduct sections using an 
25 antisense (FIGURE 10a) or sense (FIGURE 10b) GDF-9 RNA probe. Sections 
were prepared as described for FIGURE 7. 
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RGURE 1 1 shows in situ hybridization to an adult oviduct (0.5 days following 
fertilization) section using an antisense GDF-9 RNA probe. Sections were 
prepared as described for FIGURE 7. Following autoradiography and staining, 
the section was photographed under bright-field (FIGURE 11a) or dark-field 
5 (RGURE 11b) illumination. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a growth and differentiation factor, GDF-9 and 
a polynucleotide sequence encoding GDF-9. Unlike other members of the 
TGF-0 superfamily, GDF-9 expression is highly tissue specific, being expressed 
5 in cells primarily in ovarian tissue. In one embodiment, the invention provides 
a method for detection of a cell proliferative disorder of the ovary, which is 
associated with GDF-9 expression. In another embodiment, the invention 
provides a method for treating a cell proliferative disorder associated with 
abnormal expression of GDF-9 by using an agent which suppresses or 
10 enhances GDF-9 activity. 

The TGF-0 superfamily consists of multifunctionaly polypeptides that control 
proliferation, differentiation, and other functions in many cell types. Many of the 
peptides have regulatory, both positive and negative, effects on other peptide 
growth factors. The structural homology between the GDF-9 protein of this 
15 invention and the members of the TGF-0 family, indicates that GDF-9 is a new 
member of the family of growth and differentiation factors. Based on the 
known activities of many of the other members, it can be expected that GDF-9 
will also possess biological activities that will make it useful as a diagnostic and 
therapeutic reagent. 

For example, another regulatory protein that has been found to have structural 
homology with TGF-£ is inhibin, a specific and potent polypeptide inhibitor of 
the pituitary secretion of FSH. Inhibin has been isolated from ovarian follicular 
fluid. Because of its suppression of FSH, inhibin has potential to be used as 
a contraceptive in both males and females. GDF-9 may possess similar 
biological activity since it is also an ovarian specific peptide.lnhibin has also 
been shown to be useful as a marker for certain ovarian tumors (Lappohn, et 
a/ M N. Engl. J. Med., 321:790, 1989). GDF-9 may also be useful as a marker 



20 



25 
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for identifying primary and metastatic neoplasms of ovarian origin. Similarly, 
GDF-9 may be useful as an indicator of developmental anomalies in prenatal 
screening procedures. 

Another peptide of the TGF-£ family is MIS, produced by the testis and 
5 responsible for the regression of the Mullerian ducts in the male embryo. MIS 
has been show to inhibit the growth of human ovarian cancer in nude mice 
(Donahoe, ef a/., Ann. Surg., 1 94:472. 1981). GDF-9 may function similarly and 
may, therefore, be useful as an anti-cancer agent, such as for the treatment of 
ovarian cancer. 

10 GDF-9 may also function as a growth stimulatory factor and, therefore, be 
useful for the survival of various cell populations in vitro. In particular, if GDF-9 
plays a role in oocyte maturation, it may be useful in in vitro fertilization 
procedures, e.g., in enhancing the success rate. Many of the members of the 
TGF-p family are also important mediators of tissue repair. TGF-0 has been 

15 shown to have marked effects on the formation of collagen and causes a 
striking angiogenic response in the newborn mouse (Roberts, et al., Proc. Natl. 
Acad. ScL USA t 83:4167, 1986). GDF-9 may also have similar activities and 
may be useful in repair of tissue injury caused by trauma or burns for example. 

The term "substantially pure" as used herein refers to GDF-9 which is 
20 substantially free of other proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. One skilled in the art can purify GDF-9 using 
standard techniques for protein purification. The substantially pure polypeptide 
will yield a single major band on a non-reducing polyacrylamide gel. The purity 
of the GDF-9 polypeptide can also be determined by amino-terminal amino 
25 acid sequence analysis. GDF-9 polypeptide includes functional fragments of 
the polypeptide, as long as the activity of GDF-9 remains. Smaller peptides 
containing the biological activity of GDF-9 are included in the invention. 
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The invention provides polynucleotides encoding the GDF-9 protein. These 
polynucleotides include DNA, cDNA and RNA sequences which encode GDF-9. 
It is understood that all polynucleotides encoding all or a portion of GDF-9 are 
also included herein, as long as they encode a polypeptide with GDF-9 activity. 
5 Such polynucleotides include naturally occurring, synthetic, and intentionally 
manipulated polynucleotides. For example, GDF-9 polynucleotide may be 
subjected to site-directed mutagenesis. The polynucleotide sequence for GDF- 
9 also includes antisense sequences. The polynucleotides of the invention 
include sequences that are degenerate as a result of the genetic code. There 
10 are 20 natural amino acids, most of which are specified by more than one 
codon. Therefore, all degenerate nucleotide sequences are included in the 
invention as long as the amino acid sequence of GDF-9 polypeptide encoded 
by the nucleotide sequence is functionally unchanged. 

Specifically disclosed herein is a cDNA sequence for GDF-9 which is 1712 base 
15 pairs in length and contains an open reading frame beginning with a 
methionine codon at nucleotide 29. The encoded polypeptide is 441 amino 
acids in length with a molecular weight of about 49.6 kD, as determined by 
nucleotide sequence analysis. The GDF-9 sequence contains a core of 
hydrophobic amino acids near the N-terminus, suggestive of a signal sequence 
20 for secretion. GDF-9 contains four potential N-glycosylation sites at asparagine 
residues 163, 229, 258, and 325 and a putative tetrabasic proteolytic 
processing site (RRRR) at amino acids 303-306. The mature C-terminal 
fragment of GDF-9 is predicted to be 135 amino acids in length and have an 
unglycosylated molecular weight of about 15.6 kD, as determined by nucleotide 
25 sequence analysis. One skilled in the art can modify, or partially or completely 
remove the glycosyl groups from the GDF-9 protein using standard techniques. 
Therefore, the functional protein or fragments thereof of the invention includes 
glycosylated, partially glycosylated and unglycosylated species of GDF-9. 
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The degree of sequence identity of GDF-9 with known TGF-/? family members 
ranges from a minimum of 21% with Mullerian inhibiting substance (MIS) to a 
maximum of 34% with bone morphogenetic protein-4 (BMP-4). GDF-9 
specifically disclosed herein differs from the known family members in its 
5 pattern of cysteine residues in the C-terminal region. GDF-9 lacks the fourth 
cysteine of the seven cysteines present in other family members; in place of 
cysteine at this position, the GDF-9 sequence contains a serine residue. This 
GDF-9 does not contain a seventh cysteine residue elsewhere in the C-terminal 
region. 

10 Minor modifications of the recombinant GDF-9 primary amino acid sequence 
may result in proteins which have substantially equivalent activity as compared 
to the GDF-9 polypeptide described herein. Such modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the 
polypeptides produced by these modifications are included herein as long as 

15 the biological activity of GDF-9 still exists. Further, deletion of one or more 
amino acids can also result in a modification of the structure of the resultant 
molecule without significantly altering its biological activity. This can lead to the 
development of a smaller active molecule which would have brbader utility. For 
example/one can remove amino or carboxy terminal amino acids which are 

20 not required for GDF-9 biological activity. 

The nucleotide sequence encoding the GDF-9 polypeptide of the invention 
includes the disclosed sequence and conservative variations thereof. The term 
"conservative variation" as used herein denotes the replacement of an amino 
acid residue by another, biologically similar residue. Examples of conservative 
25 variations include the substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, or the substitution of one 
polar residue for another, such as the substitution of arginine for lysine, 
glutamic for aspartic acids, or glutamine for asparagine, and the like. The term 
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"conservative variation" also includes the use of a substituted amino acid in 
place of an unsubstituted parent amino acid provided that antibodies raised to 
the substituted polypeptide also immunoreact with the unsubstituted polypep- 
tide. 

DNA sequences of the invention can be obtained by several methods. For 
example, the DNA can be isolated using hybridization techniques which are 
well known in the art. These include, but are not limited to: 1) hybridization of 
genomic or cDNA libraries with probes to detect homologous nucleotide 
sequences and 2) antibody screening of expression libraries to detect cloned 
DNA fragments with shared structural features. 

Preferably the GDF-9 polynucleotide of the invention is derived from a 
mammalian organism, and most preferably from a mouse, rat, or human. 
Screening procedures which rely on nucleic acid hybridization make it possible 
to isolate any gene sequence from any organism, provided the appropriate 
probe is available. Oligonucleotide probes, which correspond to a part of the 
sequence encoding the protein in question, can be synthesized chemically. 
This requires that short, oligopeptide stretches of amino acid sequence must 
be known. The DNA sequence encoding the protein can be deduced from the 
genetic code, however, the degeneracy of the code must be taken into 
account. It is possible to perform a mixed addition reaction when the 
sequence is degenerate. This includes a heterogeneous mixture of denatured 
double-stranded DNA. For such screening, hybridization is preferably 
performed on either single-stranded DNA or denatured double-stranded DNA. 
Hybridization is particularly useful in the detection of cDNA clones derived from 
sources where an extremely low amount of mRNA sequences relating to the 
polypeptide of interest are present. In other words, by using stringent 
hybridization conditions directed to avoid non-specific binding, it is possible, 
for example, to allow the autoradiographic visualization of a specific cDNA 
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clone by the hybridization of the target DNA to that single probe in the mixture 
which is its complete complement (Wallace, ef a/., NucL Acid Res. t §:879, 
1981). 

The development of specific DNA sequences encoding GDF-9 can also be 
5 obtained by: 1) isolation of double-stranded DNA sequences from the genomic 
DNA; 2) chemical manufacture of a DNA sequence to provide the necessary 
codons for the polypeptide of interest; and 3) in vitro synthesis of a double- 
stranded DNA sequence by reverse transcription of mRNA isolated from a 
eukaryotic donor cell. In the latter case, a double-stranded DNA complement 
10 of mRNA is eventually formed which is generally referred to as cDNA. 

Of the three above-noted methods for developing specific DNA sequences for 
use in recombinant procedures, the isolation of genomic DNA isolates is the 
least common. This is especially true when it is desirable to obtain the 
microbial expression of mammalian polypeptides due to the presence of 
15 introns. 

The synthesis of DNA sequences is frequently the method of choice when the 
entire sequence of amino acid residues of the desired polypeptide product is 
known. When the entire sequence of amino acid residues of the desired 
polypeptide is not known, the direct synthesis of DNA sequences is not 

20 possible and the method of choice is the synthesis of cDNA sequences. 
Among the standard procedures for isolating cDNA sequences of interest is the 
formation of plasmid- or phage-carrying cDNA libraries which are derived from 
reverse transcription of mRNA which is abundant in donor cells that have a 
high level of genetic expression. When used in combination with polymerase 

25 chain reaction technology, even rare expression products can be cloned. In 
those cases where significant portions of the amino acid sequence of the 
polypeptide are known, the production of labeled single or double-stranded 
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DNA or RNA probe sequences duplicating a sequence putatively present in the 
target cDNA may be employed in DNA/DNA hybridization procedures which are 
carried out on cloned copies of the cDNA which have been denatured into a 
single-stranded form (Jay. ef a/., NucL Acid Res., 11:2325, 1983). 

A cDNA expression library, such as lambda gt1 1 , can be screened indirectly 
for GDF-9 peptides having at least one epitope, using antibodies specific for 
GDF-9. Such antibodies can be either polyclonally or monoclonally derived 
and used to detect expression product indicative of the presence of GDF-9 
cDNA. 

DNA sequences encoding GDF-9 can be expressed in vitro by DNA transfer 
into a suitable host cell. "Host cells" are cells in which a vector can be 
propagated and its DNA expressed. The term also includes any progeny of 
the subject host cell. It is understood that all progeny may not be identical to 
the parental cell since there may be mutations that occur during replication. 
However, such progeny are included when the term "host cell" is used. 
Methods of stable transfer, meaning that the foreign DNA is continuously 
maintained in the host, are known in the art. 

In the present invention, the GDF-9 polynucleotide sequences may be inserted 
into a recombinant expression vector. The term "recombinant expression 
vector" refers to a plasmid, virus or other vehicle known in the art that has 
been manipulated by insertion or incorporation of the GDF-9 genetic sequenc- 
es. Such expression vectors contain a promoter sequence which facilitates the 
efficient transcription of the inserted genetic sequence of the host. The 
expression vector typically contains an origin of replication, a promoter, as well 
as specific genes which allow phertotypic selection of the transformed cells. 
Vectors suitable for use in the present invention include/but are not limited to 
the T7-based expression vector for expression in bacteria (Rosenberg, ef a/.. 
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Gene ,56:125, 1987), the pMSXND expression vector for expression in 
mammalian cells (Lee and Nathans, J, Biol. Chem., 263:3521, 1988) and 
baculovirus-derived vectors for expression in insect cells. The DNA segment 
can be present in the vector operably linked to regulatory elements, for 
5 example, a promoter (e.g., T7, metailothionein l f or polyhedrin promoters). 

Polynucleotide sequences encoding GDF-9 can be expressed in either 
prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and 
mammalian organisms. Methods of expressing DNA sequences having 
eukaryotic or viral sequences in prokaryotes are well known in the art. 
10 Biologically functional viral and plasmid DNA vectors capable of expression and 
replication in a host are known in the art. Such vectors are used to incorp- 
orate DNA sequences of the invention. 

Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques as are weir known to those skilled in the art. Where 
'15 the host is prokaryotic, such as E. co//, competent cells which are capable of 
DNA uptake can be prepared from cells harvested after exponential growth 
phase and subsequently treated by the CaCI 2 method using procedures well 
known in the art. Alternatively, MgCI 2 or RbCI can be used. Transformation 
can also be performed after forming a protoplast of the host cell if desired. 

20 When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechanical procedures such as 
microinjection, electro poration, insertion of a plasmid encased in liposomes, or 
virus vectors may be used. Eukaryotic cells can also be cotransformed with 
DNA sequences encoding the GDF-9 of the invention, and a second foreign 

25 DNA molecule encoding a selectable phenotype, such as the herpes simplex 
thymidine kinase gene. Another method is to use a eukaryotic viral vector, 
such as simian virus 40 (SV40) or bovine papilloma virus, to transiently infect 
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or transform eukaryotic cells and express the protein, (see for example, 
Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1982). 

Isolation and purification of microbial expressed polypeptide, or fragments 
thereof, provided by the invention, may be carried out by conventional means 
including preparative chromatography and immunological separations involving 
monoclonal or polyclonal antibodies. 

The invention includes antibodies immunoreactive with GDF-9 polypeptide or 
functional fragments thereof. Antibody which consists essentially of pooled 
monoclonal antibodies with different epitopic specificities, as well as distinct 
monoclonal antibody preparations are provided. Monoclonal antibodies are 
made from antigen containing fragments of the protein by methods well known 
to those skilled in the art (Kohler, ef a/.. Nature, 25§:495, 1975). The term 
antibody as used in this invention is meant to include intact molecules as well 
as fragments thereof, such as Fab and F(ab') 2 , which are capable of binding 
an epitopic determinant on GDF-9. 

The term "cell-proliferative disorder" denotes malignant as well as non-malignant 
cell populations which often appear to differ from the surrounding tissue both 
morphologically and genotypicaliy. The GDF-9 polynucleotide that is an 
antisense molecule is useful in treating malignancies of the various organ 
systems, particularly, for example, the ovaries. Essentially, any disorder which 
is etiologically linked to altered expression of GDF-9 could be considered 
susceptible to treatment with a GDF-9 suppressing reagent. 

The invention provides a method for detecting a cell proliferative disorder of the 
ovary which comprises contacting an anti-GDF-9 antibody with a cell suspected 
of having a GDF-9 associated disorder and detecting binding to the antibody. 
The antibody reactive with GDF-9 is labeled with a compound which allows 
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detection of binding to GDF-9. For purposes of the invention, an antibody 
specific for GDF-9 polypeptide may be used to detect the level of GDF-9 in 
biological fluids and tissues. Any specimen containing a detectable amount of 
antigen can be used. A preferred sample of this invention is tissue of ovarian 
5 origin, specifically tissue containing granulosa cells or ovarian follicular fluid. 
The level of GDF-9 in the suspect cell can be compared with the level in a 
normal cell to determine whether the subject has a GDF-9-associated cell 
proliferative disorder. Preferably the subject is human. 

The antibodies of the invention can be used in any subject in which it is 
10 desirable to administer in vitro or in vivo immunodiagnosis or immunotherapy. 
The antibodies of the invention are suited for use, for example, in immuno- 
assays in which they can be utilized in liquid phase or bound to a solid phase 
carrier. In addition, the antibodies in these immunoassays can be detectably 
labeled in various ways. Examples of types of immunoassays which canutilize 
15 antibodies of the invention are competitive and non-competitive immunoassays 
in either a direct or indirect format. Examples of such immunoassays are the 
radioimmunoassay (RIA) and the sandwich (immunometric) assay. Detection 
of the antigens using the antibodies of the invention can be done utilizing 
immunoassays which are run in either the forward, reverse, or simultaneous 
20 modes, including immunohistochemical assays on physiological samples. 
Those of skill in the art will know, or can readily discern, other immunoassay 
formats without undue experimentation. 

The antibodies of the invention can be bound to many different carriers and 
used to detect the presence of an antigen comprising the polypeptide of the 
25 invention. Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier 
can be either soluble or insoluble for purposes of the invention. Those skilled 
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in the art will know of other suitable carriers for binding antibodies, or will be 
able to ascertain such, using routine experimentation. 

There are many different labels and methods of labeling known to those of > 
ordinary skill in the art. Examples of the types of labels which can be used in 
5 the present invention include enzymes, radioisotopes, fluorescent compounds, 
colloidal metals, chemiluminescent compounds, phosphorescent compounds, 
and bioluminescent compounds. Those of ordinary skill in the art will know of 
other suitable labels for binding to the antibody, or will be able to ascertain! 
such, using routine experimentation. 

10 Another technique which may also result in greater sensitivity consists of 
coupling the antibodies to low molecular weight haptens. These haptens can 
then be specifically detected by means of a second reaction. For example, it 
is common to use such haptens as biotin, which reacts with avidin, or 
dinitrophenyl, puridoxal, and fluorescein, which can react with specific anti- 

1 5 hapten antibodies. 

In using the monoclonal antibodies of the invention for the in vivo detection of 
antigen, the detectably labeled antibody is given a dose which is diagnostically 
effective. The term "diagnostically effective" means that the amount of 
detectably labeled monoclonal antibody is administered in sufficient quantity to 
20 enable detection of the site having the antigen comprising a polypeptide of the 
invention for which the monoclonal antibodies are specific. 

The concentration of detectably labeled monoclonal antibody which is 
adminstered should be sufficient such that the binding to those cells having the 
polypeptide is detectable compared to the background. Further, it is desirable 
25 that the detectably labeled monoclonal antibody be rapidly cleared from the 
circulatory system in order to give the best target-to-background signal ratio. 
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As a rule, the dosage of detectably labeled monoclonal antibody for in vivo 
diagnosis will vary depending on such factors as age, sex, and extent of 
disease of the individual. Such dosages may vary, for example, depending on 
whether multiple injections are given, antigenic burden, and other factors 
5 known to those of skill in the art. 

For in vivo diagnostic imaging, the type of detection instrument available is a 
major factor in selecting a given radioisotope. The radioisotope chosen must 
have a type of decay which is detectable for a given type of instrument. Still 
another important factor in selecting a radioisotope for in vivo diagnosis is that 
10 deleterious radiation with respect to the host is minimized. Ideally, a radio- 
isotope used for in vivo imaging will lack a particle emission, but produce a 
large number of photons in the 140-250 keV range, which may readily be 
detected by conventional gamma cameras. 

For in vivo diagnosis radioisotopes may be bound to immunoglobulin either 
15 directly or indirectly by using an intermediate functional group. Intermediate 
functional groups which often are used to bind radioisotopes which exist as 
metallic ions to immunoglobulins are the bifunctional chelating agents such as 
diethylenetriaminepentacetic acid (DTP A) and ethyienediaminetetraacetic acid 
(EDTA) and similar molecules. Typical examples of metallic ions which can be 
20 bound to the monoclonal antibodies of the invention are 1n ln, 97 Ru, 67 Ga, K Ga, 
72 As, 89 Zr, and^TI. 

The monoclonal antibodies of the invention can also, be labeled with a 
paramagnetic isotope for purposes of in vivo diagnosis, as in magnetic 
resonance imaging (MRI) or electron spin resonance (ESR). In general, any 
25 conventional method for visualizing diagnostic imaging can be utilized. Usually 
gamma and positron emitting radioisotopes are used for camera imaging and 
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paramagnetic isotopes for MRI. Elements which are particularly useful in such 
techniques include 157 Gd, ^Mn, 16 ?Dy, 52 Cr, and ^Fe. 

The monoclonal antibodies of the invention can be used in vitro and in vivo to 
monitor the course of amelioration of a GDF-9-associated disease in a subject. 
5 Thus, for example, by measuring the increase or decrease in the number of 
cells expressing antigen comprising a polypeptide of the invention or changes 
in the concentration of such antigen present in various body fluids, it would be 
possible to determine whether a particular therapeutic regimen aimed at 
ameliorating the GDF-9-associated disease is effective. The term "ameliorate" 
10 denotes a lessening of the detrimental effect of the GDF-9-associated disease 
in the subject receiving therapy. 

The present invention identifies a nucleotide sequence that can be expressed 
in an altered manner as compared to expression in a normal cell, therefore, it 
is possible to design appropriate therapeutic or diagnostic techniques directed 

15 . to this sequence. Thus, where a cell-proliferative disorder is associated with 
the expression of GDF-9, nucleic acid sequences that interfere with GDF-9 
expression at the translational level can be used. This approach utilizes, for 
example, antisense nucleic acid and ribozymes to block translation of a specific 
GDF-9 mRNA, either by masking that mRNA with an antisense nucleic acid or 

20 by cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to 
at least a portion of a specific mRNA molecule (Weintraub, Scientific American. 
262:40, 1990). In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA, forming a double-stranded molecule. The antisense 
25 nucleic acids interfere with the translation of the mRNA, since the cell will not 
translate a mRNA that is double-stranded. Antisense oligomers of about 1 5 
nucleotides are preferred, since they are easily synthesized and are less likely 



WO 94/15966 



PCT/US94/00685 



-21- 

to cause problems than larger molecules when introduced into the target GDF- 
9-producing cell. The use of ahtisense methods to inhibit the in vitro 
translation of genes is well known in the art (Marcus-Sakura, AnaLBiochem., 
172:289, 1988). 

5 Ribozymes are RNA molecules possessing the ability to specifically cleave 
other single-stranded RNA in a manner analogous to DNA restriction 
endonucleases. Through the modification of nucleotide sequences which 
encode these RNAs, it is possible to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech, JAmer.Med. 
10 Assn., 260 :3030. 1988). A major advantage of this approach is that, because 
they are sequence-specific, only mRNAs with particular sequences are 
inactivated. 

There are two basic types of ribozymes namely, tetrahyrnena-type (Hasselhoff, 
Nature, 334 :585. 1988) and "hammerhead"-type. Tetrahymena-type ribozymes 

15 recognize sequences which are four bases in length, while "hammerheadMype 
ribozymes recognize base sequences 11-18 bases in length. The longer the 
recognition sequence, the greater the likelihood that the sequence will occur 
exclusively in the target mRNA species. Consequently, hammerhead-type 
ribozymes are preferable to tetrahymena -type ribozymes for inactivating a 

20 specific mRNA species and 18-based recognition sequences are preferable to 
shorter recognition sequences. 

The present invention also provides gene therapy for the treatment of cell 
proliferative disorders which are mediated by GDF-9 protein. Such therapy 
would achieve its therapeutic effect by introduction of the GDF-9 antisense 
25 polynucleotide into cells having the proliferative disorder. Delivery of antisense 
GDF-9 polynucleotide can be achieved using a recombinant expression vector 
such as a chimeric virus or a colloidal dispersion system. 
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Especially preferred for therapeutic delivery of antisense sequences is trie use 
of targeted liposomes. 

Various viral vectors which can be utilized for gene therapy as taught herein 
include adenovirus, herpes virus, vaccinia; or, preferably, an RNA virus such 
as a retrovirus. Preferably, the retroviral vector is a derivative of a mume or 
avian retrovirus. Examples of retroviral vectors in which a single foreign gene 
can be inserted include, but are not limited to: Moloney murine leukemia virus 
(MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor 
virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of additional 
retroviral vectors can incorporate multiple genes. All of these vectors can 
transfer or incorporate a gene for a selectable marker so that transduced cells 
can be identified and generated. By inserting a GDF-9 sequence of interest 
into the viral vector, along with another gene which encodes the ligand for a 
receptor on a specific target cell, for example, the vector is now target specific. 
Retroviral vectors can be made target specific by inserting, for example, a 
polynucleotide encoding a sugar, a glycolipid, or a protein. Preferred targeting 
is accomplished by using an antibody to target the retroviral vector. Those of 
skill in the art will know of, or can readily ascertain without undue experimenta- 
tion, specific polynucleotide sequences which can be inserted into the retroviral 
genome to allow target specific delivery of the retroviral vector containing the 
GDF-9 antisense polynucleotide. ^ 

Since recombinant retroviruses are defective, they require assistance in order 
to produce infectious vector particles. This assistance can be provided, for 
example, by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences 
within the LTR. These plasmids are missing a nucleotide sequence which 
enables the packaging mechanism to recognize an RNA transcript for 
encapsidation. Helper cell lines which have deletions of the packaging signal 
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include, but are not limited to *2, PA317 and PA12, for example. These cell 
lines produce empty virions, since no genome is packaged. If a retroviral 
vector is introduced into such cells in which the packaging signal is intact, but 
the structural genes are replaced by other genes of interest, the vector can be 
5 packaged and vector virion produced. 

Alternatively, NIH 3T3 or other tissue culture cells can be directly transfected 
with plasmids encoding the retroviral structural genes gag, pol and env, by 
conventional calcium phosphate transfection. These cells are then transfected 
with the vector plasmid containing the genes of interest. The resulting cells 
10 release the retroviral vector into the culture medium. 

Another targeted delivery system for GDF-9 antisense polynucleotides is a 
colloidal dispersion system. Colloidal dispersion systems include macromole- 
cule complexes, nanocapsules, microspheres, beads, and lipid-based systems^ 
including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 

15 preferred colloidal system of this invention is a liposome. Liposomes are 
artificial membrane vesicles which are useful as delivery vehicles in vitro and 
in vivo. It has been shown that large unilamellar vesicles (LUV), which range 
in size from 0.2-4.0 ^m can encapsulate a substantial percentage of an 
aqueous buffer containing large macromolecules. RNA, DNA and intact virions 

20 can be encapsulated within the aqueous interior and be delivered to cells in a 
biologically active form (Fraley,ef aL, Trends Biochem. Sc/\ ( 6:77, 1981). In 
addition to mammalian cells, liposomes have been used for delivery of 
polynucleotides in plant, yeast and bacterial cells. In order for a liposome to 
be an efficient gene transfer vehicle, the following characteristics should be 

25 present: (1) encapsulation of the genes of interest at high efficiency while not 
compromising their biological activity; (2) preferential and substantial binding 
to a target cell in comparison to non-target cells; (3) delivery of the aqueous 
contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
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accurate and effective expression of genetic information (Mannino, et a/., 
Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
particularly high-phase-transition-temperature phospholipids, usually in 
combination with steroids, especially cholesterol. Other phospholipids or other 
lipids may also be used. The physical characteristics of liposomes depend on 
pH, ionic strength, and the presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl 
compounds, such as phosphatidylglycerol, phosphatidylcholine, phos- 
phatidylserine, phosphatidylethanolamine, sphingolipids, cerebrosides, and 
gangliosides. Particularly useful are diacylphosphatidylglycerols, where the lipid 
moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon 
atoms, and is saturated. Illustrative phospholipids include egg phosphatidyl- 
choline, dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine. 

The targeting of liposomes can be classified based on anatomical and 
mechanistic factors. Anatomical classification is based on the level of 
selectivity, for example, organ-specific, cell-specific, and organelle-specific. 
Mechanistic targeting can be distinguished based upon whether it is passive 
or active. Passive targeting utilizes the natural tendency of liposomes to 
distribute to cells of the reticulo-endpthelial system (RES) in organs which 
contain sinusoidal capillaries. Active targeting, on the other hand, involves 
alteration of the liposome by coupling the liposome to a specific ligand such 
as a monoclonal antibody, sugar, glycolipid, or protein, or by changing the 
composition or size of the liposome in order to achieve targeting to organs and 
cell types other than the naturally occurring sites of localization. 
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The surface of the targeted delivery system may be modified in a variety of 
ways. In the case of a liposomal targeted delivery system, lipid groups can be 
incorporated into the lipid bilayer of the liposome in order to maintain the 
targeting ligand in stable association with the liposomal bilayer. Various linking 
5 groups can be used for joining the lipid chains to the targeting ligand. 

Due to the expression of GDF-9 in the reproductive tract, there are a variety of 
applications using the polypeptide, polynucleotide and antibodies of the 
invention, related to contraception, fertility and pregnancy. GDF-9 could play 
a role in regulation of the menstrual cycle and, therefore, could be useful in 
10 various contraceptive regimens. 

The following examples are intended to illustrate but not limit the invention. 
While they are typical of those that might be used, other procedures known to 
those skilled in the art may alternatively.be used. 
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EXAMPLE 1 

IDENTIFICATION AND ISOLATION OF A NOVEL 
TGF-fl FAMILY MEMBER 

To identify a new member of the TGF-/9 superfamily, degenerate oligonucleoti- 
des were designed which corresponded to two conserved regions among the 
known family members: one region spanning the two tryptophan residues 
conserved in all family members except MIS and the other region spanning the 
invariant cysteine residues near the C-terminus. These primers were used for 
polymerase chain reactions on mouse genomic DNA followed by subcloning 
the PCR products using restriction sites placed at the 5" ends of the primers, 
picking individual E. coli colonies carrying these subcloned inserts, and using 
a combination of random sequencing and hybridization analysis to eliminate 
known members of the superfamily. 

GDF-9 was identified from a mixture of PCR products obtained with the primers 
SJL1 60 (5'-CCGGAATTCGGITGG(G/C/A)A(G/A/T/C)(G/C/A)A(G/A/T/C) 
TGG(A/G)TI(A/G)TI(T/G)CICC-3') (SEQUENCE ID NO. 1) and SJL153 (5'- 

CCGGAATTC(A/G)CAI(G/C)C(A/G)CAIC(T/C)(G/A/T- 
/C)(C/G/T)TIG(T/C)l(G/A)Cr/C)CAT-3') (SEQUENCE ID NO. 2). PCR using 
these primers was carried out with 2 ng mouse genomic DNA at 94 *C for 1 
min, 50° C for 2 min, and 72° C for 2 min for 40 cycles. 

PCR products of approximately 280 bp were gel-purified, digested with Eco Rl, 
gel-purified again, and subcloned in the Bluescript vector (Stratagene, San 
Diego, CA). Bacterial colonies carrying individual subclones were picked into 
96 well microtiter plates, and multiple replicas were prepared by plating the 
cells onto nitrocellulose. The replicate filters were hybridized to probes 
representing known members of the family, and DNA was prepared from non- 
hybridizing colonies for sequence analysis. 
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The primer combination of SJL160 and SJL153, yielded three known 
sequences (inhibin pB, BMP-2, and BMP-4) and one novel sequence 
(designated GDF-9) among 145 subclones analyzed. 

RNA isolation and Northern analysis were carried out as described previously 
5 (Lee.SJ., Mot. Endocrinol. 4:1034, 1990). An oligo dT-primed cDNA library 
was prepared from 2.5-3 m9 of ovary poly A-selected RNA in the lambda ZAP 
II vector according to the instructions provided by Stratagene* The ovary 
library was not amplified prior to screening. Filters were hybridized as 
described previously (Lee, S.-J M Proc. Natl. Acad. ScL USA., 88:4250-4254, 
10 1991). DNA sequencing of both strands was carried out using the dtdeoxy 
chain termination method (Sanger, ef at., Proc. NatL Acad. ScL, USA, 74:5463- 
5467, 1977) and a combination of the S1 nuclease/exonuclease III strategy 
(Henikoff, S. t Gene, 28:351-359, 1984) and synthetic oligonucleotide primers. 

EXAMPLE 2 

15 EXPRESSION PATTERN AND SEQUENCE OF GDF-9 

To determine the expression pattern of GDF-9, RNA samples prepared from 
a variety of adult tissues were screened by Northern analysis. Five micrograms 
of twice polyA-selected RNA prepared from each tissue were electrophoresed 
on formaldehyde gels, blotted and probed with GDF-9. As shown in Figure 1, 
20 ; the GDF-9 probe detected a 1.7 kb mRNA expressed exclusively in the ovary. 

A mouse ovary cDNA library of 1 .5 x 10 6 recombinant phage was constructed 
in lambda ZAP II and screened with a probe derived from the GDF-9 PCR 
product. The nucleotide sequence of the longest of nineteen hybridizing 
clones is shown in Figure 2. Consensus N-glycosylation signals are denoted 
25 by plain boxes. The putative tetrabasic processing sites are denoted by 
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stippled boxes. The in-frame termination codons upstream of the putative 
initiating ATG and the consensus polyadenylation signals are underlined. The 
poly A tails are not shown. Numbers indicate nucleotide position relative to the 
5' end. The 1712 bp sequence contains a long open reading frame beginning 
with a methionine codon at nucleotide 29 and potentially encoding a protein 
441 amino acids in length with a molecular weight of 49.6 kD. Like other 
TGF-/9 family members, the GDF-9 sequence contains a core of hydrophobic 
amino acids near the N-terminus suggestive of a signal sequence for secretion. 
GDF-9 contains four potential N-glycosylation sites at asparagine residues 1 63, 
229, 258, and 325 and a putative tetrabasic proteolytic processing site (RRRR) 
at amino acids 303-306. The mature C-terminal fragment of GDF-9 is predicted 
to be 135 amino acids in length and have an unglycosylated molecular weight 
of 15.6 kD. 

Although the C-terminal portion of GDF-9 clearly shows homology with the 
other family members, the sequence of GDF-9 is significantly diverged from 
those of the other family members (Figures 3 and 4). Figure 3 shows the 
alignment of the C-termina! sequences of GDF-9 with the corresponding 
regions of human GDF-1 (Lee, Proc. Natl. Acad. Sci. USA, 88:4250-4254, 
1991), Xenopus Vg-1 (Weeks, ef a/., Cell, 51:861-867, 1987), human Vgr-1 
(Celeste, et al., Proc. Natl. Acad. Sci. USA, 87:9843-9847, 1990), human OP-1 
(Ozkaynak. ef a/., EMBO J., 9:2085-2093, 1990), human BMP-5 (Celeste, et al., 
Proc. Natl. Acad. Sci. USA, 87:9843-9847, 1990), Drosophila 60A (Wharton, et 
al., Proc. Natl. Acad. Sci. USA, 88:9214-9218, 1991), human BMP-2 and 4 
(Wozney, ef al., Science, 242:1528-1534, 1988). Drosophila DPP (Padgett, ef 
al., Nature, 325:81-84, 1987), human BMP-3 (Wozney, et al., Science, 
242:1528-1534, 1988), human MIS (Cate, etal., Cell, 45:685-698, 1986), human 
inhibin , ^A, and pB (Mason, etal., Biochem, Biophys. Res. Commun., 135:957- 
964, 1986), human TGF-^1 (Derynck, ef al., Nature, 316:701-705. 1985), 
humanTGF-^2 (deMartin, ef al., EMBO J., g:3673-3677, 1987), human TGF-/33 
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(ten Dijke, et a/., Proc. Natl. Acad. Sci. USA, 85:4715-4719, 1988), chicken TGF- 
04 (Jakowlew, et ai t MoL Endocrinol., 2:11 86-1 195 t 1988), and Xenopus TGF- 
p5 (Kondaiah, ef at., J: Biol. Chem., 265:1089-1093, 1990). The conserved 
cysteine residues are shaded. Dashes denote gaps introduced in order to 
5 maximize the alignment. 

Figure 4 shows the amino acid homologies among the different members of 
' the TGF-/9 superfamily. Numbers represent percent amino acid identities 
between each pair calculated from the first conserved cysteine to the 
C-terminus. Boxes represent homologies among highly-related members within 
10 particular subgroups. 

the degree of sequence identify with known family members ranges from a 
minimum of 21% with MIS to a maximum of 34% with BMP-4. Hence, GDF-9 
is comparable to MIS in its degree of sequence divergence from the other 
members of this superfamily. Moreover, GDF-9 shows no significant sequence 

15 homology to other family members in the pro-region of the molecule. GDF-9 
also differs from the known* family members in its pattern of cysteine residues 
in the C-terminal region. GDF-9 lacks the fourth cysteine of the seven 
cysteines that are present in all other family members; in place of cysteine at 
this position, the GDF-9 sequence contains a serine residue. In addition, GDF- 

20 9 does not contain a seventh cysteine residue elsewhere in the C-terminal 
region. 
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EXAMPLE 3 

IMMUNOCHEMICAL LOCALIZATION OF GDF-9 
IN THE ZONA PELLUCIDA 

To determine whether GDF-9 mRNA was translated, sections of adult ovaries 
were incubated with antibodies directed against recombinant GDF-9 protein. 
In order to raise antibodies against GDF-9, portions of GDF-9 cDNA spanning 
amino acids 30 to 295 (pro-region) or 308 to 441 (mature region) were cloned 
into the T7-based pET3 expression vector (provided by F.W. Studier, 
Brookhaven National Laboratory), and the resulting plasmids were transformed 
into the BL21 (DE3) bacterial strain. Total cell extracts from isopropyl b-D- 
thiogalactoside-induced cells were electrophoresed on SDS/polyacrylamide 
gels, and the GDF-9 protein fragments were excised, mixed with Freund's 
adjuvant, and used to immunize rabbits by standard methods known to those 
of skill in the art. All immunizations were carried out by Spring Valley Lab 
(Sykesville, MD). The presence of GDF-9-reactive antibodies in the sera of 
these rabbits was assessed by Western analysis of bacterially-expressed 
protein fragments. The resulting serum was shown to react with the bacterially- 
expressed protein by Western analysis. 

For immunohistochemical studies, ovaries were removed from adult mice, fixed 
in 4% paraformaldehyde, embedded in paraffin, and sectioned. Sites of 
antibody binding were detected by using the Vectastain ABC kit, according to 
the instructions provided by Vector Laboratories. FIGURE 5 shows the 
immunohistochemical localization of GDF-9 protein. Adjacent sections of an 
adult ovary were either stained with hematoxylin and eosin (FIGURE 5a) or 
incubated with immune (FIGURE 5b) or pre-immune (FIGURE 5c) serum at a 
dilution of 1 :500. As shown in FIGURE 5b, the antiserum detected protein 
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solely in oocytes. No staining was detected using pre-imrnune serum (FIGURE 
5c). Hence, GDF-9 protein appears to translated in vivo by oocytes. 

EXAMPLE 4 
ISOLATION OF HUMAN GDF-9 

5 in order to isolate a cDNA clone encoding human GDF-9, a cDNA library was 
constructed in lambda ZAP II using poly A-selected RNA prepared from an 
adult human ovary. From this library, a cDNA clone containing the entire 
human GDF-9 coding sequence was identified using standard screening 
techniques as in Example 1 and using the murine GDF-9 clone as a probe. 

10 A comparison of the predicted amino acid sequences of murine (top lines) and 
.... human (bottom lines) GDF-9 is shown in FIGURE 6. Numbers represent amino t 
acid positions relative to the N-terminL Vertical lines represent sequence 
identities. Dots represent gaps introduced in order to maximize the alignment.^ 
The clear box shows the predicted proteolytic processing sites. The shaded 

15 _ boxes show the cysteine residues in the mature region of the proteins. The , 
bars at the bottom show a schematic of the pre- (clear) and mature (shaded) 
regions of GDF-9 with the percent sequence identities between the murine and, 
human sequences shown below. 

Like murine GDF-9, human GDF-9 contains a hydrophobic leader sequence, 
20 a putative RXXR proteolytic cleavage site, and a C-terminal region containing 
the hallmarks of other TGF-£ family members. Murine and human GDF-9 are 
64% identical in the pro- region and 90% identical in the predicted mature 
region of the molecule. The high degree of homology between the two 
sequences suggests that human GDF-9 plays an important role during 
25 embryonic development and/or in the adult ovary. 
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EXAMPLE 5 

NUCLEI C ACID DETECTION OF EXPRESSION OF GDF-9 IN OOCYTES 

In order to localize the expression of GDF-9 in the ovary, in situ hybridization 
to mouse ovary sections was carried out using an antisense GDF-9 RNA 
probe. FIGURE 7 shows in situ hybridization to adult ovary sections using a 
GDF-9 RNA probe. [^SJ-labeled anti-sense (FIGURE 7a and 7c) or sense 
(FIGURE 7 b and 7d) GDF-9 RNA probes were hybridized to adjacent paraffin- 
embedded sections of ovaries fixed in 4% paraformaldehyde. Sections were 
dipped in photographic emulsion, exposed, developed, and then stained with 
hematoxylin and eosin. Two representative fields are shown. 

As shown in FIGURES 7a and 7c, GDF-9 mRNA was detected primarily in 
oocytes in adult ovaries. Every oocyte (regardless of the stage of follicular 
development) examined showed GDF-9 expression, and no expression was 
detected in any other cell types. No hybridization was seen using a control 
GDF-9 sense RNA probe (FIGURE 7b and 7d). Hence, GDF-9 expression 
appears to be oocyte-specific in adult ovaries. 

To determine the pattern of expression of GDF-9 mRNA during ovarian 
development, sections of neonatal ovaries were probed with a GDF-9 RNA 
probe. FIGURE 8 shows in situ hybridization to a postnatal day 4 ovary 
section using an antisense GDF-9 RNA probe. Sections were prepared as 
described for FIGURE 7. Following autoradiography and staining, the section 
was photographed under bright-field (FIGURE 8a) or dark-field (FIGURE 8b) 
illumination. 



FIGURE 9 shows in situ hybridization to postnatal day 8 ovary sections using 
an antisense (FIGURE 9a) or sense (FIGURE 9b) GDF-9 RNA probe. Sections 
were prepared as described for FIGURE 7. 
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GDF-9 mRNA expression was first detected at the onset of follicular 
development. This was most clearly evident at postnatal day 4, where only 
oocytes that were present in follicles showed GDF-9 expression (FIGURE 8); 
no expression was seen in oocytes that were not surrounded by granulosa 
5 cells. By postnatal day 8, every oocyte appeared to have undergone follicular 
development, and every oocyte showed GDF-9 expression (FIGURE 9). 

To determine whether GDF-9 was also expressed following ovulation, sections 
of mouse oviducts were examined by in situ hybridization. FIGURE 10 shows 
in situ hybridization to adult oviduct sections using an antisense (FIGURE 10a) 
10 or sense (FIGURE 10b) GDF-9 RNA probe. Sections were prepared as 
described for FIGURE 7. 

FIGURE 11 shows in situ hybridization to an adult oviduct (0.5 days following 
fertilization) section using an antisense GDF-9 RNA probe. Sections were 
prepared as described for FIGURE 7. Following autoradiography and staining, 
15 the section was photographed under bright-field (FIGURE 11a) or dark-field , 
(FIGURE 11b) illumination. 

As shown in FIGURE 10, GDF-9 was expressed by oocytes that had been 
released into the oviduct. However, the expression of GDF-9 mRNA turned off 
rapidly following fertilization of the oocytes; by day 0.5 following fertilization, 
20 only some embryos (such as the one shown in FIGURE 1 1) expressed GDF-9 
mRNA, and by day '1.5, all embryos were negative for GDF-9 expression. 

Although the invention has been described with reference to the presently 
preferred embodiment, it should be understood that various modifications can 
be made without departing from the spirit of the invention. Accordingly, the 
25 invention is limited only by the following claims. 
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SUMMARY OF SEQUENCES 

Sequence ID No. 1 is the nucleotide sequence for the primer, SJL1 60, for GDF- 
9 (page 24, lines 15 and 16); 

Sequence ID No. 2 is the nucleotide sequence for the primer, SJL153, for GDF- 
9 (page 24, lines 17 and 18); 

Sequence ID No. 3 is the nucleotide and deduced amino acid sequence for 
GDF-9 (Figure 2); 

Sequence ID No. 4 is the deduced amino acid sequence for GDF-9 (Figure 2); 

Sequence ID No. 5 is the amino acid sequence of the C-terminus of GDF-3 
(Figure 3); 

Sequence ID No. 6 is the amino acid sequence of the C-terminus of GDF-9 
(Figure 3); 

Sequence ID No. 7 is the amino acid sequence of the C-terminus of GDF-1 
(Figure 3); 

Sequence ID No. 8 is the amino acid sequence of the C-terminus of Vg-1 
(Figure 3); 

Sequence ID No. 9 is the amino acid sequence of the C-terminus of Vgr-1 
(Figure 3); 

Sequence ID No. 10 is the amino acid sequence of the C-terminus of OP-1 
(Figure 3); 
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Sequence ID No. 1 1 is the amino acid sequence of the C-terminus of BMP-5 
(Figure 3); 

Sequence ID No. 12 is the amino acid sequence of the C-terminus of 60A 
(Figure 3); 

5 Sequence ID No. 13 is the amino acid sequence of the C-terminus of BMP-2 
(Figure 3); 

Sequence ID No. 14 is the amino acid sequence of the C-terminus of BMP-4 
(Figure 3); 

Sequence ID No. 15 is the amino acid sequence of the C-terminus of DPP 
10 (Figure 3); 

Sequence ID No. 16 is the amino acid sequence of the C-terminus of BMP-3 
J (Figure 3); 

Sequence ID No. 17 is the amino acid sequence of the C-terminus of MIS 
(Figure 3); 

1 5 Sequence ID No. 1 8 is the amino acid sequence of the C-terminus of inhibin 
a (Figure 3); 

Sequence ID No. 1 9 is the amino acid sequence of the C-terminus of inhibin 
£A (Figure 3); 

Sequence ID No. 20 is the amino acid sequence of the C-terminus of inhibin 
20 pB (Figure 3); 
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Sequence ID No. 21 is the amino acid sequence of the C-terminus of TGF-01 
(Figure 3); 

Sequence ID No. 22 is the amino acid sequence of the C-terminus of TGF-#2 
(Figure 3); 

Sequence ID No. 23 is the amino acid sequence of the C-terminus of TGF-^3 
(Figure 3); 

Sequence ID No. 24 is the amino acid sequence of the C-terminus of TGF-^4 
(Figure 3); 

Sequence ID No. 25 is the amino acid sequence of the C-terminus of TGF-/95 
(Figure 3); and 

Sequence ID No. 26 is the amino acid sequence of human GDF-9 (Figure 6). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: THE JOHNS HOPKINS UNIVERSITY 
(ii) TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR- 9 
5 (iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubitz 

(B) STREET: 1880 Century Park East, Suite 500 
10 (C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 90067 

(v) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: 

(B) FILING DATE: 12 -JAN -19 94 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Wetherell, Jr. Ph.D., John R. 
25 (B) REGISTRATION NUMBER: 31,678 

(C) REFERENCE/DOCKET NUMBER: FD3288 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 455-5100 

(B) TELEFAX: (619) 455-5110 

30 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL160 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .35 

(D) OTHER INFORMATION: /note- "Where "B" occurs, B - 
inosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCGGAATTCG GBTGGVANVA NTGGRTBRTB KCBCC 
(2) INFORMATION FOR SEQ ID NO:2: r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL153 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CCGGAATTCR CADSCRCADC YNBTDGYDRY CAT 
(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: GDF-9 

(ix) FEATURE: 
10 (A) NAME/KEY: CDS 

(B) LOCATION: 29. .1351 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



15 



20 



ATGCGTTCCT TCTTAGTTCT TCCAAGTC ATG GCA CTT CCC AGC AAC TTC CTG sf 

Met Ala Leu Pro Ser Asn Phe Leu 
1 5 



TTG GGG GTT TGC TGC TTT GCC TGG CTG TGT TTT CTT AGT AGC CTT AGC 
Leu Gly Val Cys Cys Phe Ala Trp Leu Cys Phe Leu Ser Ser Leu Ser 
10 15 20 



10O 



TCT CAG GCT TCT ACT GAA GAA TCC CAG AGT GGA GCC AGT GAA AAT GTG 148? 
Ser Gin Ala Ser Thr Glu Glu Ser Gin Ser Gly Ala Ser Glu Asn Val 
25 30 35 40 

GAG TCT GAG GCA GAC CCC TGG TCC TTG CTG CTG CCT GTA GAT GGG ACT 196 
Glu Ser Glu Ala Asp Pro Trp Ser Leu Leu Leu Pro Val Asp Gly Thr 
45 50 55 

25 GAC AGG TCT GGC CTC TTG CCC CCC CTC TTT AAG GTT CTA TCT GAT AGG 244 

Asp Arg Ser Gly Leu Leu Pro Pro Leu Phe Lys Val Leu Ser Asp Arg 
60 65 70 

CGA GGT GAG ACC CCT AAG CTG CAG CCT GAC TCC AGA GCA CTC TAC TAC 292 
Arg Gly Glu Thr Pro Lys Leu Gin Pro Asp Ser Arg Ala Leu Tyr Tyr 
30 75 80 85 
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ATG AAA AAG CTC TAT AAG ACG TAT GCT ACC AAA GAG GGG GTT CCC AAA 340 
Met Lys Lys Leu Tyr Lys Thr Tyr Ala Thr Lys Glu Gly Val Pro Lys 
90 95 100 

CCC AGC AGA AGT CAC CTC TAC AAT ACC GTC CGG CTC TTC AGT CCC TGT 388 
Pro Ser Arg Ser His Leu Tyr Asn Thr Val Arg Leu Phe Ser Pro Cys 
105 110 115 120 

GCC CAG CAA GAG CAG GCA CCC AGC AAC CAG GTG ACA GGA CCG CTG CCG 436 
Ala Gin Gin Glu Gin Ala Pro Ser Asn Gin Val Thr Gly Pro Leu Pro 
125 130 135 

ATG GTG GAC CTG CTG TTT AAC CTG GAC CGG GTG ACT GCC ATG GAA CAC 484 
Met Val Asp Leu Leu Phe Asn Leu Asp Arg Val Thr Ala Met Glu His 
.140 145 150 

TTG CTC AAA TCG GTC TTG CTA TAC ACT CTG AAC AAC TCT GCC TCT TCC 532 
Leu Leu Lys Ser Val Leu Leu Tyr Thr Leu Asn Asn Ser Ala Ser Ser 
155 160 165 

TCC TCC ACT GTG ACC TGT ATG TGT GAC CTT GTG GTA AAG GAG GCC ATG 580 
Ser Ser Thr Val Thr Cys Met Cys Asp Leu Val Val Lys Glu Ala Met 
170 175 180 

TCT TCT GGC AGG GCA CCC CCA AGA GCA CCG TAC TCA TTC ACC CTG AAG 628 
Ser Ser Gly Arg Ala Pro Pro Arg Ala Pro Tyr Ser Phe Thr Leu Lys 
185 190 195 200 

AAA CAC AGA TGG ATT GAG ATT GAT GTG ACC TCC CTC CTT CAG CCC CTA 676 
Lys His Arg Trp He Glu He Asp Val Thr Ser Leu Leu Gin Pro Leu 
205 210 215 

GTG ACC TCC AGC GAG AGG AGC ATT CAC CTG TCT GTC AAT TTT ACA TGC 724 
Val Thr Ser Ser Glu Arg Ser He His Leu Ser Val Asn Phe Thr Cys 
220 225 230 

ACA AAA GAC CAG GTG CCA GAG GAC GGA GTG TTT AGC ATG CCT CTC TCA 772 
Thr Lys Asp Gin Val Pro Glu Asp Gly Val Phe Ser Met Pro Leu Ser 
235 240 245 



GTG CCT CCT TCC CTC ATC TTG TAT CTC AAC GAC ACA AGC ACC CAG GCC 
Val Pro Pro Ser Leu He Leu Tyr Leu Asn Asp Thr Ser Thr Gin Ala 
250 255 260 
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WO 94/15966 



PCT/US94/00685 



-41- 



TAC CAC TCT TGG GAG TCT CTT CAG TCC ACC TGG AGG CCT TTA CAG CAT 
Tyr His Ser Trp Gin Ser Leu Gin Ser Thr Trp Arg Pro Leu Gin His 
265 270 275 280 



868 



CCC GGC CAG GCC GGT GTG GCT GCC CGT CCC GTG AAA GAG GAA GCT ACT 
Pro Gly Gin Ala Gly Val Ala Ala Arg Pro Val Lys Glu Glu Ala Thr 
285 290 295 



916 



GAG GTG GAA AGA TCT CCC CGG CGC CGT CGA GGG CAG AAA GCC ATC CGC 
Glu Val Glu Arg Ser Pro Arg Arg Arg Arg Gly Gin Lys Ala lie Arg 
300 305 310 



964 



10 TCC GAA GCG AAG GGG CCA CTT CTT ACA GCA TCC TTC AAC CTC AGC GAA 

Ser Glu Ala Lys Gly Pro Leu Leu Thr Ala Ser Phe Asn Leu Ser Glu 
315 320 325 



1012 



15 



TAC TTC AAA CAG TTT CTT TTC CCC CAA AAC GAG TGT GAA CTC CAT GAC 
Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu Cys Glu Leu His Asp 
330 335 340 



1060 



TTC AGA CTG AGT TTT AGT CAG CTC AAA TGG GAC AAC TGG ATC GTG GCC 
Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp Asn Trp lie Val Ala 
345 350 355 360 



1108 



CCG CAC AGG TAC AAC CCT AGG TAC TGT AAA GGG GAC TGT CCT AGG GCG 
20 Pro His Arg Tyr Asn Pro Arg Tyr Cys Lys Gly Asp Cys Pro Arg Ala 

365 370 375 



1156 



GTC AGG CAT CGG TAT GGC TCT CCT GTG CAC ACC ATG GTC CAG AAT ATA 
Val Arg His Arg Tyr Gly Ser Pro Val His Thr Met Val Gin Asn lie 
380 385 390 



1204 



25 ^ ATC TAT GAG AAG CTG GAC CCT TCA GTG CCA AGG CCT TCG TGT GTG CCG 
lie Tyr Glu Lys Leu Asp Pro Ser Val Pro Arg Pro Ser Cys Val Pro 
395 400 405 



1252 



30 



GGC AAG TAC AGC CCC CTG AGT GTG TTG ACC ATT GAA CCC GAC GGC TCC 
Gly Lys Tyr Ser Pro Leu Ser Val Leu Thr lie Glu Pro Asp Gly Ser 
410 415 420 



1300 



ATC GCT TAC AAA GAG TAC GAA GAC ATG ATA GCT ACG AGG TGC ACC TGT 
lie Ala Tyr Lys Giu Tyr Glu Asp Met lie Ala Thr Arg Cys Thr Cys 
425 430 435 440 



1348 
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CGT TAGCATGGGG GCCACTTCAA CAAGCCTGCC TGGCAGAGCA ATGCTGTGGG 
Arg 



1401 



CCTTAGAGTG CCTGGGCAGA GAGCTTCCTG TGACCAGTCT CTCCGTGCTG CTCAGTGCAC 
ACTGTGTGAG CGGGGGAAGT GTGTGTGTGT GGATGAGCAC ATCGAGTGCA GTGTCCGTAG 
GTGTAAAGGG CACACTCACT GGTCGTTGCC ATAAACCAAG TGAAATGTAA CTCATTTGGA 
GAGCTCTTTC TCCCCACGAG TGTAGTTTTC AGTGGACAGA TTTGTTAGCA TAAGTCTCGA 
GTAGAATGTA GCTGTGAACA TGTCAGAGTG CTGTGGTTTT ATGTGACGGA AGAATAAACT 
GTTGATGGCA T 



1461 
1521 
1581 
1641 
1701 
1712 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Ala Leu Pro Ser Asn Phe Leu Leu Gly Val Cys Cys Phe Ala Trp 
1 5 10 15 

Leu Cys Phe Leu Ser Ser Leu Ser Ser Gin Ala Ser Thr Glu Glu Ser 
20 25 30 

Gin Ser Gly Ala Ser Glu Asn Val Glu Ser Glu Ala Asp Pro Trp Ser 
35 40 45 

Leu Leu Leu Pro Val Asp Gly Thr Asp Arg Ser Gly Leu Leu Pro Pro 
50 55 60 

Leu Phe Lys Val Leu Ser Asp Arg Arg Gly Glu Thr Pro Lys Leu Gin 
65 70 75 80 

Pro Asp Ser Arg Ala Leu Tyr Tyr Met Lys Lys Leu Tyr Lys Thr Tyr 
85 90 95 
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Ala Thr Lys Glu Gly Val Pro Lys Pro Ser Arg Ser His Leu Tyr Asn 
100 105 110 

Thr Val Arg Leu Phe Ser Pro Cys Ala Gin Gin Glu Gin Ala Pro Ser 
115 120 125 

5 Asn Gin Val Thr Gly Pro Leu Pro Met Val Asp Leu Leu Phe Asn Leu 

130 135 140 

Asp Arg Val Thr Ala Met Glu His Leu Leu Lys Ser Val Leu Leu Tyr 
145 150 155 160 

Thr Leu Asn Asn Ser Ala Ser Ser Ser Ser Thr Val Thr Cys Met Cys 
10 ~ 165 170 175 

Asp Leu Val Val Lys Glu Ala Met Ser Ser Gly Arg Ala Pro Pro Arg 
180 185 190 

Ala Pro Tyr Ser Phe Thr Leu Lys Lys His Arg Trp lie Glu lie Asp 
195 200 205 

15 Val Thr Ser Leu Leu Gin Pro Leu Val Thr Ser Ser Glu Arg Ser lie 

210 215 220 

His Leu Ser Val Asn Phe Thr Cys Thr Lys Asp Gin Val Pro Glu Asp 
225 230 235 240 

Gly Val Phe Ser Met Pro Leu Ser Val Pro Pro Ser Leu lie Leu Tyr 
20 245 250 255 

Leu Asn Asp Thr Ser Thr Gin Ala Tyr His Ser Trp Gin Ser Leu Gin 
260 265 270 

Ser Thr Trp Arg Pro Leu Gin His Pro Gly Gin Ala Gly Val Ala Ala 
275 280 285 

25 Arg Pro Val Lys Glu Glu Ala Thr Glu Val Glu Arg Ser Pro Arg Arg 

290 295 300 

Arg Arg Gly Gin Lys Ala lie Arg Ser Glu Ala Lys Gly Pro Leu Leu 
305 310 315 320 

Thr Ala Ser Phe Asn Leu Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro 
30 325 330 335 
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Gln Asn Glu Cys Glu Leu His Asp Phe Arg Leu Ser Phe Ser Gin Leu 
340 345 350 

Lys Trp Asp Asn Trp He Val Ala Pro His Arg Tyr Asn Pro Arg Tyr 
355 360 365 

Cys Lys Gly Asp Cys Pro Arg Ala Val Arg His Arg Tyr Gly Ser Pro 
370 375 380 

Val His Thr Met Val Gin Asn He He Tyr Glu Lys Leu Asp Pro Ser 
385 390 395 400 

Val Pro Arg Pro Ser Cys Val Pro Gly Lys Tyr Ser Pro Leu Ser Val 
405 410 415 

Leu Thr He Glu Pro Asp Gly Ser He Ala Tyr Lys Glu Tyr Glu Asp 
420 425 430 

Met He Ala Thr Arg Cys Thr Cys Arg 
435 440 

( 2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: GDF-3 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .117 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Arg Arg Ala Ala He Ser Val Pro Lys Gly Phe Cys Arg Asn Phe 
1 5 10 15 
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Cys His Arg His Gin Leu Phe lie Asn Phe Gin Asp Leu Gly Trp His 
20 25 30 

Lys Trp Val lie Ala Pro Lys Gly Phe Met Ala Asn Tyr Cys His Gly 
35 40 45 

5 Glu Cys Pro Phe Ser Met Thr Thr Tyr Leu Asn Ser Ser Asn Tyr Ala 

50 55 60 

Phe Met Gin Ala Leu Met His Met Ala Asp Pro Lys Val Pro Lys Ala 
65 70 75 80 

Val Cys Val Pro Thr Lys Leu Ser Pro He Ser Met Leu Tyr Gin Asp 

*10 . . 85 90 95 

Ser Asp Lys Asn Val lie Leu Arg His Tyr Glu Asp Met Val Val Asp 
100 105 110 

Glu Cys Gly Cys Gly 
115 

15 (2) INFORMATION FOR SEQ ID NO: 6: r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single ^ 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: GDF-9 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Phe Asn Leu Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu 
1 5 10 15 
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Cys Glu Leu His Asp Phe Arg Leu 

20. 

Asn Trp He Val Ala Pro His Arg 
35 40 

5 Asp Cys Pro Arg Ala Val Arg His 

50 55 

Met Val Gin Asn He He Tyr Glu 
65 70 

Pro Ser Cys Val Pro Gly Lys Tyr 
10 85 

Glu Pro Asp Gly Ser He Ala Tyr 
100 

Thr Arg Cys Thr Cys Arg 
115 



Ser Phe Ser Gin Leu Lys Trp Asp 
25 30 

Tyr Asn Pro Arg Tyr Cys Lys Gly 
45 

Arg Tyr Gly Ser Pro Val His Thr 
60 

Lys Leu Asp Pro Ser Val Pro Arg 
75 80 

Ser Pro Leu Ser Val Leu Thr He 
90 95 

Lys Glu Tyr Glu Asp Met He Ala 
105 HO 



15 (2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: GDF-1 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein- 

(B) LOCATION: 1. .122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly Ala 
15 10 15 
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Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp His 
20 25 30 

Arg Trp Val lie Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly 

.:: 35 40 45 

5 Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala 

50 55 60 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Gly 
65 70 75 80 

Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser 
110 85 90 95 

Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr Glu 
100 105 110 

Asp Met Val Val Asp Glu Cys Gly Cys Arg 

115 120 - 

15 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: Vg-1 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Arg.Lys Arg Ser Tyr Ser Lys Leu Pro Phe Thr Ala Ser Asn lie 
I 5 10 15 
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Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
20 25 30 

Asn Trp Val lie Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 
35 40 45 

5 Glu Cys Pro Tyr Pro Leu Thr Glu lie Leu Asn Gly Ser Asn His Ala 

50 55 60 

lie Leu Gin Thr Leu Val His Ser lie Glu Pro Glu Asp lie Pro Leu 
65 70 75 80 

Pro Cys Cys Val Pro Thr Lys Met Ser Pro lie Ser Met Leu Phe Tyr 
10 85 90 95 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
100 105 110 

Asp Glu Cys Gly Cys Arg 
115 

15 (2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: Vgr-1 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Arg Val Ser Ser Ala Ser Asp Tyr Asn Ser Ser Glu Leu Lys Thr Ala 
15 10 15 
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Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
20 25 30 

Asp Trp lie lie Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
35 40 45 

5 Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 

50 55 60 

lie Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
65 70 75 ftO 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 
10 85 90 95 

Asp Asp Asn Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 
100 105 110 

Arg Ala Cys Gly Cys His 
115 

15 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

20 (D) TOPOLOGY: linear : 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: OF-1 

(ix) FEATURE : 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin Arg Gin Ala 
1 5 10 15 
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Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
20 . 25 30 

Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 
35 40 45 

5 Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr Asn His Ala 

50 55 60 

He Val Gin Thr Leu Val His Phe He Asn Pro Glu Thr Val Pro Lys 
65 70 75 80 

Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val Leu Tyr Phe 
10 85 90 95 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
100 105 110 

Arg Ala Cys Gly Cys His 
115 

15 (2) INFORMATION FOR SEQ ID N0:11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP- 5 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. . 118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 
1 5 10 ' 15 
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Cys Lys Lys His Glu Leu Tyr Val 
20 

Asp Trp lie He Ala Pro Glu Gly 

- 35 40 

5 Glu Cys Ser Phe Pro Leu Asn Ala 

50 55 

He Val Gin Thr Leu Val His Leu 
65 70 

Pro Cys Cys Ala Pro Thr Lys Leu 
10 85 

Asp Asp Ser Ser Asn Val He Leu 
100 



Ser Phe Arg Asp Leu Gly Trp Gin 
25 .30 

Tyr Ala Ala Phe Tyr Cys Asp Gly 
45. 

His Met Asn Ala Thr Asn His Ala 
60 

Met Phe Pro Asp His Val Pro Lys 
75 80 

Asn Ala He Ser Val Leu Tyr Phe 
90 95 

Lys Lys Tyr Arg Asn Met Val Val 
105 110 



Arg Ser Cys Gly Cys His 
115 



15 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
20 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 60A 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg Ser 
1 5 10 15 
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Cys Gin Met Gin Thr Leu Tyr lie Asp Phe Lys Asp Leu Gly Xrp His 
20 25 30 

Asp Trp lie lie Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser Gly 
35 40 45 

5 Glu Cys Asn Phe Pro Leu Asn Ala His Met: Asn Ala Thr Asn His Ala 

50 55 60 

lie Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro Lys 
65 70 75 80 

Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr His 
10 85 90 95 



Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met lie Val 
100 105 110 

Lys Ser Cys Gly Cys His 
115 

15 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP- 2 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .117 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Glu Lys Arg Gin Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser 
1 .5 10 15 
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Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
20 25 30 

Asp Trp He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly 
35 AO ; 45 

5 Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 

50 55 .60 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala 
65 70 75 80 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp 
10 85 90 95 

Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu 
. 100 105 HO 

Gly Cys Gly Cys Arg 
115 

15 (2) INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



20 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP-4 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1..117 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn 
■ l ' 5 10 15 
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Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
20 25 30 

Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly 
35 40 45 

5 Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 

50 55 60 

lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser lie Pro Lys Ala 
65 70 75 80 

Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp 
10 85 90 95 

Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu 
100 105 110 

Gly Cys Gly Cys Arg 
115 

15 (2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: DPP 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Lys 
1 



Arg His Ala Arg Arg Pro Thr Arg Arg Lys Asn His Asp Asp Thr 
5 10 15 
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Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asp 
20 25 30 

Asp Trp He Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
35 40 ^5 

5 Lys Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser Thr Asn His Ala 

50 55 60 

Val Val Gin Thr Leu Val Asn Asn Met Asn Pro Gly Lys Val Pro Lys 
65 70 75 80 

Ala Cys Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met Leu Tyr Leu 
10 85 90 95 

Asn Asp Gin Ser Thr Val Val Leu Lys Asn Tyr Gin Glu Met Thr Val 
100 105 HO 

Val Gly Cys Gly Cys Arg 
115 

15 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP- 3 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1..119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Thr Leu Lys Lys Ala Arg Arg Lys Gin Trp He Glu Pro Arg Asn 
• l 5 .10 15 



BNSDOCID: <WO 9415966A1 I > 



BNS oaae 5', 



WO 94/15966 



PCT/US94/00685 



-56- 



Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 
20 25 30 

Glu Trp He He Ser Pro Lys. Ser Phe Asp Ala Tyr Tyr Cys Ser Gly 
35 40 45 

5 Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 

50 55 60 

Thr He Gin Ser He Val Arg Ala Val Gly Val Val Pro Gly lie Pro 
65 70 75 80 

Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser He Leu Phe 
10 85 90 95 

Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met Thr 
100 105 HO 

Val Glu Ser Cys Ala Cys Arg 
115 

15 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MIS 



(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. . 115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Pro 
1 



Gly Arg Ala Gin Arg Ser Ala Gly Ala Thr Ala Ala Asp Gly Pro 
5 10 15 
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Cys Ala Leu Arg Glu Leu Ser Val Asp Leu Arg Ala Glu ferg Ser Val 
20 25 30 

Leu lie Pro Glu Thr Tyr Gin Ala Asn Asn Cys Gin Gly Val Cys Gly 
35 40 45 

5 Trp Pro Gin Ser Asp Arg Asn Pro Arg Tyr Gly Asn His Val Val Leu 

50 55 60 

Leu Leu Lys Met Gin Ala Arg Gly Ala Ala Leu Ala Arg Pro Pro Cys 
65 70 75 80 

Cys Val Pro Thr Ala Tyr Ala Gly Lys Leu Leu lie Ser Leu Ser Glu 
10 85 90 95 

Glu Arg lie Ser Ala His His Val Pro Asn Met Val Ala Thr Glu Cys 
100 105 110 

Gly Cys Arg 

115 * v 
15 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhibin alpha 

(ix) FEATURE: 
.25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Leu Arg Leu Leu Gin Arg Pro Pro Glu Glu Pro Ala Ala His Ala Asn 
15 10 15 
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Cys His Arg Val Ala Leu Asn lie Ser Phe Gin Glu Leu Gly Trp Glu 

20 .25 30 

Arg Trp He Val Tyr Pro Pro Ser Phe He Phe His Tyr Cys His Gly 

35 .40 45 

5 Gly Cys Gly Leu His He Pro Pro Asn Leu Ser Leu Pro Val Pro Gly 

50 55 60 

Ala Pro Pro Thr Pro Ala Gin Pro Tyr Ser Leu Leu Pro Gly Ala Gin 
65 70 75 80 

Pro Cys Cys Ala Ala Leu Pro Gly Thr Met Arg Pro Leu His Val Arg 
10 85 90 95 

Thr Thr Ser Asp Gly Gly Tyr Ser Phe Lys Tyr Glu Thr Val Pro Asn 
100 105 HO 

Leu Leu Thr Gin His Cys Ala Cys He 
115 120 



15 (2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhibin betaA 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Arg Arg Arg Arg Arg Gly Leu Glu Cys Asp Gly Lys Val Asn He Cys 
1 5 10 15 
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Cys Lys Lys Gin Phe Phe Val Ser Phe Lys Asp lie Gly Trp Asn Asp 
20 25 30 

Trp lie lie Ala Pro Ser Gly Tyr His Ala Asn Tyr Cys Glu Gly Glu 

- 35 40 45 

5 Cys Pro Ser His lie Ala Gly Thr Ser Gly Ser Ser Leu Ser Phe His 

50 55 60 

Ser Thr Val lie Asn His Tyr Arg Met Arg Gly His Ser Pro Phe Ala 
65 70 75 80 

Asn Leu Lys Ser Cys Cys Val Pro Thr Lys Leu Arg Pro Met Ser Met 
10 85 90 95 

Leu Tyr Tyr Asp Asp Gly Gin Asn lie lie Lys Lys Asp lie Gin Asn 
100 105 110 

Met He Val Glu Glu Cys Gly Cys Ser 
115 120 

15 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhibin betaB 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Arg He Arg Lys Arg Gly Leu Glu Cys Asp Gly Arg Thr Asn Leu Cys 
15 10 15 
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Cys Arg Gin Gin Phe Phe lie Asp Phe Arg Leu lie Gly Trp Asn Asp 

20 .25 30 

Trp lie lie Ala Pro Thr Gly Tyr Tyr Gly Asn Tyr Cys Glu Gly Ser 
35 40 45 

5 Cys Pro Ala Tyr Leu Ala Gly Val Pro Gly Ser Ala Ser Ser Phe His 

50 55 60 

Thr Ala Val Val Asn Gin Tyr Arg Met Arg Gly Leu Asn Pro Gly Thr 
65 70 75 80 

Val Asn Ser Cys Cys lie Pro Thr Lys Leu Ser Thr Met Ser Met Leu 
10 85 90 95 

Tyr Phe Asp Asp Glu Tyr Asn lie Val Lys Arg Asp Val Pro Asn Met 
100 105 110 

lie Val Glu Glu Cys Gly Cys Ala 
115 120 

15 (2) INFORMATION FOR SEQ ID NO; 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

<B) CLONE: TGF-betal 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1..114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Arg Arg Ala Leu Asp Thr Asn Tyr Cys Phe Ser Ser Thr Glu Lys Asn 
15 10 15 
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Cys Cys Val Arg Gin Leu Tyr He Asp Phe Arg Lys Asp Leu Gly Trp 
20 25 30 

Lys Trp He His Glu Pro Lys Gly Tyr His Ala Asn Phe Cys Leu Gly 

- 35 * 40 45 

5 Pro Cys Pro Tyr He Trp Ser Leu Asp Thr Gin Tyr Ser Lys Val Leu 

50 55 60 

Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys Cys 
65 70 75 80 

Val Pro Gin Ala Leu Glu Pro Leu Pro He Val Tyr Tyr Val Gly Arg 
10 85 90 95 

Lys Pro Lys Val Glu Gin Leu Ser Asn Met He Val Arg Ser Cys Lys 
100 105 HO 

Cys Ser 



15 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta2 

(ix) FEATURE : 
25 (A) NAME /KEY : Protein 

(B) LOCATION: 1. .114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Lys Arg Ala Leu Asp Ala Ala Tyr Cys Phe Arg Asn Val Gin Asp Asn 
! 1 5 " 10 15 
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Cys Cys Leu Arg Pro Leu Tyr lie Asp Phe Lys Arg Asp Leu Gly Trp 
20 25 30 

Lys Trp lie His Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala Gly 
35 40 45 

5 Ala Cys Pro Tyr Leu Trp Ser Ser Asp Thr Gin His Ser Arg Val Leu 

50 55 60 

Ser Leu Tyr Asn Thr lie Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
65 70 75 80 

Val Ser Gin Asp Leu Glu Pro Leu Thr lie Leu Tyr Tyr lie Gly Lys 
10 85 90 95 

Thr Pro Lys lie Glu Gin Leu Ser Asn Met lie Val Lys Ser Cys Lys 
100 105 110 

Cys Ser 



15 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta3 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



Lys Arg Ala Leu Asp Thr Asn Tyr Cys Phe Arg Asn Leu Glu Glu Asn 
1 5 10 15 
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Cys Cys Val Arg Pro Leu Tyr lie Asp Phe Arg Gin Asp Leu Gly Trp 
20 25 30 



Lys Trp Val His Glu Pro Lys Gly Tyr Tyr Ala Asn Phe Cys Ser Gly 
35 40 45 



5 Pro Cys Pro Tyr Leu Arg Ser Ala Asp Thr Thr His Ser Thr Val Leu 

50 55 60 

Gly Leu Tyr Asn Thr Leu Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
65 70 75 80 

Val Pro Gin Asp Leu Glu Pro Leu Thr lie Leu Tyr Tyr Val Gly Arg 

10 85 90 95 



Thr Pro Lys Val Glu Gin Leu Ser Asn Met Val Val Lys Ser Cys Lys 
100 105 110 



Cys Ser 



15 (2) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta4 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .116 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Arg Arg Asp Leu Asp Thr Asp Tyr Cys Phe Gly Pro Gly Thr Asp Glu 
1 5 10 15 
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Lys Asn Cys Cys Val Arg Pro Leu Tyr lie Asp Phe Arg Lys Asp Leu 
20 25 30 

Gin Trp Lys Trp lie His Glu Pro Lys Gly Tyr Met Ala Asn Phe Cys 
35 40 45 

5 Met Gly Pro Cys Pro Tyr lie Trp Ser Ala Asp Thr Gin Tyr Thr Lys 

50 55 60 

Val Leu Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro 
65 70 75 80 

Cys Cys Val Pro Gin Thr Leu Asp Pro Leu Pro lie lie Tyr Tyr Val 
10 85 90 95 

Gly Arg Asn Val Arg Val Glu Gin Leu Ser Asn Met Val Val Arg Ala 
100 105 110 

Cys Lys Cys Ser 
115 

15 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta5 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Lys 
1 



Arg Gly Val Gly Gin Glu Tyr Cys Phe Gly Asn Asn Gly Pro Asn 
5 10 15 
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Cys Cys Val Lys Pro Leu Tyr lie Asn Phe Arg Lys Asp Leu Gly Trp 
20 25 30 

Lys Trp lie His Glu Pro Lys Gly Tyr Glu Ala Asn Tyr Cys Leu Gly 
35 40 45 

5 Asn Cys Pro Tyr lie Trp Ser Met Asp Thr Gin Tyr Ser Lys Val Leu 

50 55 60 

Ser Leu Tyr Asn Gin Asn Asn Pro Gly Ala Ser lie Ser Pro Cys Cys 
65 70 75 80 

Val Pro Asp Val Leu Glu Pro Leu Pro lie He Tyr Tyr Val Gly Arg 
10 85 90 95 

Thr Ala Lys Val Glu Gin Leu Ser Asn Met Val Val Arg Ser Cys Asn 
100 105 110 

Cys Ser 



15 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: HUMAN GDF-9 

(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1. .454 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26 ; 
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Met Ala Arg Pro Asn Lys Phe Leu Leu Trp Phe Cys Cys Phe Ala Trp 
1 5 10 15 

Leu Cys Phe Pro lie Ser Leu Gly Ser Gin Ala Ser Gly Gly Glu Ala 
20 25 30 

5 Gin lie Ala Ala Ser Ala Glu Leu Glu Ser Gly Ala Met Pro Trp Ser 

35 40 45 

Leu Leu Gin His lie Asp Glu Arg Asp Arg Ala Gly Leu Leu Pro Ala 
50 55 60 

Leu Phe Lys Val Leu Ser Val Gly Arg Gly Gly Ser Pro Arg Leu Gin 
10 65 70 75 80 

Pro Asp Ser Arg Ala Leu His Tyr Met Lys Lys Leu Tyr Lys Thr Tyr 
85 90 95 

Ala Thr Lys Glu Gly lie Pro Lys Ser Asn Arg Ser His Leu Tyr Asn 
100 105 110 

15 Thr Val Arg Leu Phe Thr Pro Cys Thr Arg His Lys Gin Ala Pro Gly 

115 120 125 

Asp Gin Val Thr Gly lie Leu Pro Ser Val. Glu Leu Leu Phe. Asn Leu 
130 135 140 

Asp Arg lie Thr Thr Val Glu His Leu Leu Lys Ser Val Leu Leu. Tyr 
20 145 150 155 160 

Asn lie Asn Asn Ser Val Ser Phe Ser Ser Ala Val Lys Cys Val Cys 
165 170 175 

Asn Leu Met lie Lys Glu Pro Lys Ser Ser Ser Arg Thr Leu Gly Arg 
180 185 190 



25 Ala Pro Tyr Ser Phe Thr Phe Asn Ser Gin Phe Glu Phe Gly Lys Lys 

195 200 205 

His Lys Trp lie Gin lie Asp Val Thr Ser Leu Leu Gin Pro Leu Val 
210 215 220 

Ala Ser Asn Lys Arg Ser lie His Met Ser lie Asn Phe Thr Cys Met 
30 225 230 235 240 
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Lys Asp Gin Leu Glu His Pro Ser Ala Gin Asn Gly Leu Phe Asm Met 
245 250 255 

Thr Leu Val Ser Pro Ser Leu lie Leu Tyr Leu Asn Asp Thr Ser Ala 
260- 265 270 

5 Gin Ala Tyr His Ser Trp Tyr Ser Leu His Tyr Lys Arg Arg Pro Ser 

275 280 285 

Gin Gly Pro Asp Gin Glu Arg Ser Leu Ser Ala Tyr Pro Val Gly Glu 
290 295 300 

Glu Ala Ala Glu Asp Gly Arg Ser Ser His His Arg His Arg Arg Gly 
10 305 310 315 320 

Gin Glu Thr Val Ser Ser Glu Leu Lys Lys Pro Leu Gly Pro Ala Ser 
325 330 335 

Phe Asn Leu Ser Glu Tyr Phe Arg Gin Phe Leu Leu Pro Gin Asn Glu 

340 345 350 J * 

15 Cys Glu Leu His Asp Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp 

355 360 365 

Asn Trp He Val Ala Pro His Arg Tyr Asn Pro Arg Tyr Cys Lys Gly 
370 375 380 

Asp Cys Pro Arg Ala Val Gly His Arg Tyr Gly Ser Pro Val His Thr 
20 385 390 395 400 

Met Val Gin Asn He He Tyr Glu Lys Leu Asp Ser Ser Val Pro Arg 
405 410 415 

Pro Ser Cys Val Pro Ala Lys Tyr Ser Pro Leu Ser Val Leu Thr He 
420 425 430 

25 Glu Pro Asp Gly Ser He Ala Tyr Lys Glu Tyr Glu Asp Met He Ala 

435 440 445 



Thr Lys Cys Thr Cys Arg 
450 
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CLAIMS 

• 1 . Substantially pure growth differentiation factor-9 (GDF-9) and functional 
fragments thereof. 

2. An isolated polynucleotide sequence encoding the GDF-9 polypeptide 
of claim 1 . 

3. The polynucleotide sequence of claim 2, wherein the polynucleotide is 
isolated from a mammalian ceil. 

4. The polynucleotide of claim 3, wherein the mammalian cell is selected 
from the group consisting of mouse, rat, and human cell. 

5. An expression vector including the polynucleotide of claim 2. 

6. The vector of claim 5, wherein the vector is a plasmid. 

7. The vector of claim 5, wherein the vector is a virus. 

8. A host cell stably transformed with the vector of claim 5. 

9. The host cell of claim 8, wherein the cell is prokaryotic. 

1 0. The host cell of claim 8, wherein the cell is eukaryotic. 

1 1 . Antibodies reactive with the polypeptide of claim 1 or fragments thereof. 

12. The antibodies of claim 11, wherein the antibodies are polyclonal. 
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13. The antibodies of claim 11, wherein the antibodies are monoclonal. 

14. A method of detecting a cell proliferative disorder comprising 
contacting the antibody of claim 11 with a specimen of a subject 
suspected of having a GDF-9 associated disorder and detecting binding 
of the antibody. 

15. The method of claim 14, wherein the cell proliferative disorder is an 
ovarian tumor. 

1 6. The method of claim 1 4, wherein the detecting is in vivo. 

17. The method of claim 16, wherein the antibody is detectably : labeled. 

18. The method of claim 17, wherein the detectable label is selected from 
the group consisting of a radioisotope, a fluorescent compound, a 
bioluminescent compound and a chemiluminescent compound. 

1 9. The method of claim 1 4 f wherein the detection is in vitro. 

20. The method of claim 19, wherein the antibody is detectably labeled. 

21 . The method of claim 20, wherein the label is selected from the group 
consisting of a radioisotope, a fluorescent compound, a bioluminescent 
compound, a chemoluminescent compound and an enzyme. 

22. A method of treating a cell proliferative disorder associated with 
expression of GDF-9, comprising contacting the cells with a reagent 
which suppresses the GDF-9 activity. 
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23. The method of claim 22, wherein the reagent is an anti-GDF-9 antibody. 

24. The method of claim 22, wherein the reagent is a GDF-9 antisense 
sequence. 

25. The method of claim 22, wherein the cell proliferative disorder is an 
ovarian tumor. 

26. The method of claim 22, wherein the reagent which suppresses GDF-9 
activity is introduced to a cell using a vector. 

27. The method of claim 26, wherein the vector is a colloidal dispersion 
system. 

28. The method of claim 27, wherein the colloidal dispersion system is a 
liposome. 

29. The method of claim 28, wherein the liposome is essentially target 
specific. 

30. The method of claim 29, wherein the liposome is anatomically targeted. 

31 . The method of claim 29, wherein the liposome is mechanistically 
targeted. 

32. The method of claim 31 , wherein the mechanistic targeting is passive, 

33. The method of claim 31 , wherein the mechanistic targeting is active. 
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34. The method of claim 33, wherein the liposome is actively targeted by 
coupling with a moiety selected from the group consisting of a sugar, 
a glycolipid, and a protein. 

35. The method of claim 34, wherein the protein moiety is an antibody. 

36. The method of claim 35, wherein the vector is a virus. 

37. The method of claim 36, wherein the virus is an RNA virus. 

38. The method of claim 37, wherein the RNA virus is a retrovirus. 

39. The method of claim 38, wherein the retrovirus is essentially, target 
specific. 
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